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Abstract 



For any positive integer n, a new family of periodic functions in power series form 
and of period n is used to solve in closed form a class of polynomial equations of 
order n. The n roots are the values of the appropriate function from that family 
taken at 0, 1, n — 1. 



1 Background 



For centuries mathematicians have sought closed-form expressions for the roots of poly- 
nomial equations of arbitrary order n. It is now well known that polynomials of order 
four or less can be solved explicitly using rational operations and finite root extractions. 
The Abel-Ruffini theorem shows that this cannot be done for orders five and above 0. 

Several authors have proposed series solutions of algebraic (and polynomial) equations 
( 13 0, 13 [H])- These solutions, which rely on hypergeometric functions, are cum- 
bersome to implement and have not provided feasible alternatives to standard numerical 
methods. 

For a positive integer n, we will describe here in elementary fashion a family of func- 
tions 



of two real variables t and u that are power series in t and are periodic of period n in 
the variable u. (These functions are analogous to complex Fourier series except that the 
summation extends over positive indices only). 

For a limited class of polynomial equations of order n (parameterized in some way by 
t) the j3' m s will be found explicitly so that the series ^2 \Pmt m \ converges and the values 
of x(t,u) at t and u — 0, 1, ...,n — 1 will be the n roots of the equation. The function 
x(t,u) , which could be thought as "elementary" in the same way 




(1.1) 
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is an elementary function, thus provides explicit solutions simply through its values at 
t and u =0, 1, n — 1. 

This will be only a first step as the class of polynomial equations solved explicitly with 
these power series is limited. The ultimate goal is to generalize the approach proposed 
here to any polynomial equation. 



2 Preliminaries 

We start off with a polynomial equation in the form 

x n = a n _ix n_1 + a„_ 2 x n " 2 + ... + a x x + a (2.1) 

where the a' k s are real or complex coefficients and = pe 10 (p > 0, —it < < it) is as- 
sumed throughout to be non-zero. (Otherwise (|2.1|) can trivially be reduced to an equation 
of degree n — 1) . 

Equation (j2.1j) is transformed by multiplying the right-hand side by t n where t is a 
real variable that we may initially think of as small but is destined to take on any real 
value including 1: 



x n = (an-tx 11 ' 1 + a n - 2 x n ' 2 + ... + aix + a )t n . (2.2) 

We will find a solution in the form of a function x(t,u) given in Eq. JO}. We will 
show that ^2 \P m t m \ converges (and x(t,u) is therefore defined) when t is small enough 
or | cio | large enough, or the modulii \ak\ other than |ao| small enough. 

When the series converges the values of x(t, u) at t and u = 0, 1, ...,n — 1 will provide 
the n roots of Equation (J2.2J) . Indeed, we will show that if we define the partial sums 

<? 

x(t,u) q d ^J2Pme 2uwmxl/n xt m , (2.3) 

m=l 

then for u = 0, 1, n — 1 the polyonomial in t 



E[x{t,u) )= x(t,u) 



Q>n-l 



x(t, u) 



n-1 



+ a n -2 



n-2 



x (ti u )„ + ... + aix(t,u) +a )t n 



(2.4) 



will approach for q — >• oo. 

We begin by seeking the roots expressed as the infinite series 



x{t) = bit + b 2 t 2 + b 3 t 3 



(2.5) 



where a priori we will need n different sequences {b m } to generate the n roots. We will 
find these b' m s and show that in fact they are each of the form 



b m = p m e 2k ^ n ,k = 0,l,...,n-l 



(2.6) 
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required in Eq. (Jl.lj) with a single sequence {/3 m } that will be obtained explicitly through 
a simple discrete dynamical system. 

The equation to solve is now 

x(t) n = (a n _ix(t) n_1 + a n _ 2 x(t) n " 2 + ... + aiz(t) + a )t n . (2.7) 
Before proceeding we need some notations and preliminary results. 

We define the powers B r of the partial sums of x(t): 

B r g d ^(b 1 t + b 2 t 2 + ... + b q t q ) r , q,reN, (2.8) 

where for ease of notation the functional dependence of B r on t is omitted. We let 
K{d,B r ^j denote the coefficient of t d in B r q . (K(d,B^ does not depend on t and 
K (d, B r q ) = 0ifd<rord> r.q) . 

Proposition 2.1. With d>r, the coefficients K(d,B'^j satisfy 

K(d,Bl)=K(d,B r d „ r+1 ) Vg>d-r + l, (2.9) 

and 

min(d—r+l,q) 

K(d,B r q )= J2 b m K(d-m,B r q - 1 ) : r > 2, d > r, (2.10) 

m=l 

9-1 

K(q + s, B°Xl)= b x K{ q + 8-1, + J2 b ™ K (l + s ~™, B Us-n) + b q b{. (2.11) 

m=2 



Proof. Equation ()2.9|) expresses the fact that when q > d — r + 1 then only the first 
d — r+1 b^s enter into K[d, B q ) . Equation (|2.10|) is the convolution rule used to express 
the coefficient of t d in B q considered as the product B r q ~ l x B q . Equation ()2.11|) uses 
()2.9|) to express a form of Eq. ()2.10j) in which all the K(a, B^) s have identical values for 
a and u. 

□ 

With x(t) given in Eq. ()2.5|) both sides of Eq. (J2.7J) are polynomials in t with powers 
> n. The goal is to find recursively the b' m s so that for each p > n the coefficients of 
t p are equal on both sides of ()2.7j) . This will be done by considering the roots x(t) in the 
form of the gradually expanding partial sums B q . The coefficient b\ will be found by 
seeking a solution of the form B\ for which the coefficients of t n on both sides of (12.71) 
are equal. With b\ thus determined, b 2 is found by seeking a solution of the form B\ for 
which the coefficients of t n+1 on both sides of ()2.7|) are equal, etc. 
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We begin with a candidate solution B\ = b\t by setting equal the coefficients of t n on 
both side of Eq. (j2~7|) . i.e. 

B{n,B?) =b n l= a . (2.12) 
Therefore the n possible values of b\ are 

b 1 = p Vn e i(9+2kir)/n k = 0,1, ...,U - 1 (2.13) 

which is Eq. (12. 6|) with m = 1 and 

^dg- p l/n e ie/n ( 214 ) 

We note that the n values x(t) = b±t with b± of Eq. (|2.13jl are the exact trivial solutions 
of Eq. ()2.7|) when all coefficients other than a are 0; otherwise they provide crude first- 
approximation solutions when t is small. 

In order to find 62 with a candidate solution B\ = b\t + bit 2 , we note that the coeffi- 
cient of t n+1 on the left side of Eq. (j2HI) is K(n + 1,B%) and on the right is axK{\, B\). 
Therefore we want 

K(n + 1,5") = ai K(l,Bl). (2.15) 
Similarly, equating the coefficients of t n+2 with a solution B\ yields 

K(n + 2,B! 3 l ) = a 2 K(2,B 2 3 ) + ai K(2,Bl). (2.16) 

When we equate the coefficients of t n+3 a similar expression arises with a third term 
involving the coefficient a 3 . In general, to equate the coefficients of t n+q on both sides of 
Eq. (|2.7J) (with a candidate solution B q+1 ) one needs: 

min(n—l,q) 

K(n + q,B n q+1 )= £ a m K(q, B? +1 ) q = 1, 2, ... (2.17) 

m=l 

Equation (|2.9j) shows that for any s > 1 

ff(n + g,B£.i) =K(n + g,i?; +s ) (2.18) 



= K(g,5 g m _ m+1 ) = (2.19) 
These equations show that ()2.17|) implies for any s 

min(n~l,q) 

K(n + q,B^ s )= £ 0,^(9, B£j. (2.20) 

m=l 

Therefore if ()2.17|) is satisfied, the coefficients of t n+q on both sides of Eq. (|2.7|) are also 
equal when any number of terms bkt k are added to the partial sum B^ +1 . 

We next write the K(a, B") s appearing in Eq. ()2.17|) in such a way that the indices 
a and u are equal. Equation ()2.9|) shows that Eq. (|2.17|) is equivalent to 
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min(n— l,q) 

K(n + q, B n n+q ) = £ a m K(q, Bfj q = 1, 2... (2.21) 

m=l 

Given that b q = K[q, B^j, the task will be to find n sequences of b' m s for which the 
infinite system (J2.21)) is satisfied for all q. We will achieve this through a simple discrete 
dynamical system that generates the V s. A change of variable on the dynamical system 
will show that the n sequences {b m } are of the form j/3 m e 2fc7rm>a / n } required in Eq. (jl.lj) 
(with k — 0,1, — 1) . Finally we will give the convergence conditions for the series 

\Pmt m \ and show that the x(t, k) s are the n roots in the sense of (|2.4jl . 

3 Matrix formulation 

We first define the sequence of n — 1 dimensional vectors 

V{qp(K(q,B\) K(q+l,B 2 q+1 ) ... K(q + n - 2, flj£_ 2 ))' g = l,2... (3.1) 
where the apostrophe means "transpose" so that V(q) is a column vector. We note that 

V(l)={b l b\ b\ ... ^r 1 )'. (3.2) 

Also the s - th component V(2) g of V(2) is K[s + 1,B* +1 ) = s.b 2 b s f l with b 2 to be 
determined from Eq. 1)2.21)1 for q = 1 : 

n.b 2 b\~ l = K(n + l, B r ,l +1 ) = a x h (3.3) 
from which b 2 = a,\j (nfr™ -2 ) and therefore 

V(2) = (a^nb^- 2 laxjnb^ (n - 1) adrift?)'. (3.4) 

We next define the sequence of n — 1 dimensional vectors X(q) obtained as the con- 
volution of the 6-s with the vectors V(q) : 

X{q-l) de d-Y^b m V{q-m+l) g = 3,4,.... (3.5) 

m=2 

We let V^(g) r and X(q) r denote the r — th components of V(q) and X(q). The left-hand 
side of Eq. (l2~TT| is V(q) 8+1 and Eq. (l2~TTJ) with s = 1 is then 

V(q) 2 = hViq), + X(q - 1\ + Viq)^ = 2b 1 V(q\ + X(q - l) r (3.6) 
With s = 2 Eq. flUD) is 

V(q) 3 = hV(q) 2 + X(g - 1) 2 + V{q\bl (3.7) 
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and more generally 



V(q) s+1 = b 1 V(q) s + X(q - l) s + V{q)&, a = 1, 2, n - 2. 



Equation (|2.11|) with s = n — 1 is 

K(q + n-l, B n q+n ^) = + X(q - 1)^ + ^g)^ 1 . 

If we define the n — 1 dimensional square matrix 



/26i 


-1 


... 


o\ 


6? 


bl -1 


... 







6i 


-1 ... 







... 


&i 


-1 


v&r 1 


... 




bl) 



Mi 



then Eqs. (|3.6j) - (|3.9|) (for s = 1, 2, n — 1) can be written compactly as 

/ \ 



-X(q-l) = M 1 V(q)- 



\K(q + n-l,B- +n _i)J 



(3i 



(3.9) 



(3.10) 



(3.11) 



The goal is to express V(q) as a function of past vectors V(q — 1), V(q — 2), V(l). The 
inverse of Mi is 



/ & n ~ 2 fe^ 1 
-(n - 2)& n ~ 1 2& n " 2 26r 3 
-(n-3)6J -(n-S)^" 1 3&r 2 



6? 



h 

2bi 

3bl 



1 \ 

2bi 

3bl 



-2b? 



2n-5 



<2n-6 



2n-4 



-2&f 



26™" (n-2)b n i" z {n-2)b n i 



vn— 3 



-&f- D -ft?" 1 (n - i)&rv 

(3.12) 

If (M x x ) . . is the entry in the i-th row, j-th entry of this inverse, Eq. ()3.12|) can be written 

as 



(Mf) 



■(n — i) J 2 if i > j 



(3.13) 



We multiply both sides of Eq. ()3.11|) by M 1 1 to obtain 

/ 

V(q) = -Mi X X(q - 1) + Mf 1 



(3.14) 



\^(? + W-l^?+n-l)/ 
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From Eq. (|2.21|) we note that 

m£n(n— 1) 

K(q + n-l,B^ n _ 1 )= £ a m K(q - 1, B™ J (3.15) 

m=l 

min(n— l,q— 1) miu(ii-l,}-l) 

5^ a m ^(g-m) m = ^ [A m .V(g - m)] (3.16) 

m=l m=l 

where each A p (p — 1, 2, n — 1) is an (n — l)-dimensional row vector with a p in p — th 
position and zeros elsewhere. (Here and elsewhere an expression in square brackets will 
generally represent a scalar product). 
The vector V(q) of Eq. ()3.14|) is now 



V(q) = -M^X( 



D + 



min(n— l,q— 1) ( 

E [A m .T%-m)] 

m=l 



rife? 



n-l 



1 

2&i 
3bj 



(3.17) 



V(n - 1) VT'J 



For each one of the n values p 1 / n e t ( e '+ 2fc7r )/ n ( A; = 0, 1, n — 1) of &i the sequence 
^(g) yields all the subsequent fe^s since hi is the first component of V(i). We thus have 
the n sequences that will provide the n roots (under the right as-yet-unproved con- 
vergence conditions). 

A change of variable on the sequence V(m) which defines a new sequence W(m) 
will now show that the n values of each b m can be expressed in the form p m e 2k7rmx ^ n 
{k = 0,1, ...,n — 1) where a single sequence {/3 m } is defined recursively as a function of 
the a' k s . 



We will first need the diagonal matrix 



n-l 



B 



def. 



b r i 







and 





n-2 



0\ 



&™" 3 



b\J 



(3.18) 



n 



( 1 1 ... 

-(n-2) 2 2 

-(n-3) -(n-3) 3 

-1 -1 ... 



.. 1 \ 

2 
3 

■1 n-l/ 



(3.19) 
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The matrix M is the inverse M x 1 of ()3.12|) in which b\ would be set to 1. With these 
notations 



Mf 1 = ^-B~ l M B 
The sequence W(q) derived from V(q) is defined as 

W{qyy-b\-*B.V{q), g = l,2,.... 



(3.20) 



(3.21) 



We will see that the sequence W(q) is independent of the particular b± chosen among 
its n possible values p 1 / n e H + 2fc7r )/« (fc = 0,l,...,n — 1). The fact that each one of the b' m s 
is of the form p m e 2k7rmxi ^ n (k = 0, 1, ...,n — 1) follows then from the fact that the first 
components V(q) 1 and W(q) 1 satisfy 



V(q\ = Wiq^bl/ao. 



(3.22) 



In the theorem below we give for the sequence W(q) an expression analogous to (|3.17j) 
for V(q) and provide the expressions for the P' m s. 



Theorem 3.1. The sequence W(q) is defined as 

W(l)=a (l 1 .. 



1)' 



W{2) = °±(1 2 



n 



(3.23) 
(3.24) 



q— 2 min(n—l,q—l) 

W(q) = — -MY^[u.W(p + l)]W(q-p) +U [A v .W{q-v)\ 
p=i p=i 

(3.25) 

where: 

1. u is the (n-1)- dimensional row vector having 1 in first position and zeros elsewhere. 

2. U is the (n-1)- dimensional column vector ^(l 2 n — l) . 

The 0' s that define the n sequences 



are 



b m = P m e 2k7Tmxl/n , k = 0, 1, n - 1; m = 1, 2, ... 



/3 m = p m l n x e i{me)/n x W{m)Ja Q , m = 1, 2, 



(3.26) 



(3.27) 



Proof. The expressions of (I3.23|) - ()3.24j) follow from the definition in Eq. ()3.21|) and Eqs. 
(j32J) and which define 1/(1) and V(2). 

Using Eq. ()3.5j) to express X(q — 1) appearing in V(q) of Eq. ()3.17j) yields 



'9-1 



V(q) = bl^B^Wiq) = -Mf 1 ^bJ/T™ B^W (q - m + 1) + 



\m=2 



min(n— l,g— 1) 

£ 

p=i 



^.(ftT^'VCg-p) 



n-l 



1 

26i 
36? 

\(n - 1) 6TV 



g = 3,4, 



(3.28) 



Bearing in mind that: 

* B x M^x-B" 1 = M/6i; 6? = a 

* each b m is 6™ _n multiplied by the first component [u.W(m)] of W(m) 

* the p — th component of B^ 1 W(q — p) is b^~ n W(q — p) = b^W(q — p) j do, 
we multiply both sides of Eq. ()3.28j) by b\" q B to obtain 



W(q) =- 
a 



q—l min(n—l,q—l) 

-MY^[u-W(m)]W(q-m+l) + U [ A P -W(q-p)\ 

p=i 



m=2 



g = 3,4, 



(3.29) 

which is Eq. (I3.25J) once the index m in the first sum is set equal to p + 1 with p going 
from 1 to q — 2. 

In view of Eq. ()3.22|) the first component b m of V^m) is equal to 



V?W{m)Ja Q = p m/n x e m{e+2k7T)lln W{m)Ja 



which is (|3.2fjjl with f3 m given in Eq. (J3.27j) . 



(3.30) 
□ 



4 Convergence results 

In order to establish convergence conditions for the series of (j2.f>j) we need to assess the 
growth of the first components W(m) 1 of the vectors W(m). We will use the norm 
l^loo = maa; |^i| °f a complex vector V— {Vj). We also define the row-sum norm 



|A|| = maxi 1 I 



(4.1) 



of an m— dimensional square matrix A = (a^) and recall that |^4.V| < ||A|||V| C 
If we define the function of two variables 

z(u, n) = —2u 2 + u(2n + 1) — n 
then the norm of the (n — 1)— dimensional matrix M of Eq. (j3.19j) is 

1 



Ml 



n 



max(z[floor(n/2 + 1/4) , n], z[ceil(n/2 + 1/4) , n] 



(4.2) 



(4.3) 
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where the floor (x) and ceil(x) functions are the largest integer smaller than x and the 
smallest integer less than x. 

We next define the sequence of modulii 

w(q)^\W(Q)L ,9 = 1,2,... (4.4) 

which are upper bounds for the modulii of the W(q)' 1 s of interest. We also define the 
maximum modulus of the coefficients other than ao : 

a d = max la J. (4.5) 

k=l,2,...,n-l 

We note that w(l) = \ao\ and w(2) < a. From Eq. (|H.25jl we now have 



q—2 min(n—l,q—l) 

(ti-n \^yj(p+l)\\M\\xw{q-p) + a w ti- P) ' 9 = 3,4,... (4.6) 



w 

\a 



p=i p=i 



with an important distinction to be made between the cases q < n and q > n. In the 
former case the p in the second sum on the right-hand side of (|4.6J) goes to q — 1 with a 
last term w(l) = \a \. The inequality of f)4.6|) is then 



w{q) < -r^-A Y\\\M\\w{p+l) + a]w{q-p) ) + a, g = 3,4,...,n (4.7) 
l a °l \ P =i J 

We next define 

Ai=||M||/|ao| (4.8) 



and the sequence of functions 



a/\\M\\, if p<n 



^ = rM/;>7 '7^ = 2,3,.. (4.9) 

In the case q > n the index p in the second sum on the right-hand side of ()4.6)1 goes to 
n — 1 and (|4.fi|) can now be written 



w 



q-2 

(q) </x( 34+1) + ^i(«)W9-p) ) (4-10) 
p=i 



< /x^[w(p+ 1) + a p+ i(a)][w(g-p) + a,_ p (a)] j , g = n + 1, n + 2, ... (4.11) 

where the purpose of this last inequality is to bound the sequence {w(q)} by a particular 
type of discrete convolution which will be considered below. 

We first define recursively a sequence of nonnegative functions S q (a,\a \) : 

S 2 (a,\a \) d =a + a 2 (a) (4.12) 
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de 1 ( q ~ 2 \ 

S q (a, \a \) = - — | V"[||M||S , p+ i (a, \a \) + a]S q - p (a, |a |) + a + a q (a) ,q = 3,4, ...,n 
l a °l \ P =i J 

(4.13) 

where the functional notation emphasizes for future reference a dependence of S q on 
a and \a \ (even though S 2 (a:, \a \) of (|4.12jl does not depend on |a |). 
We next continue the sequence of S's with 

S 9 (a, |a |) =>( y^^p + i(a, |a |)5 9 _ p (a, |a |) I , g = n + 1, n + 2, ... (4.14) 

Lemma 4.1. With the notations given above, 

w(q) <w(q) + a q (a) < S q (a,\a \), g = 2,3, ... (4.15) 

Proof. The inequality of (|4.15jl is true for q = 2 because w(2) < a. In view of (|4.7|) and 
(|4. 13jl it is then also true for q = 3,4, n. For g = n + 1, ()4.11|) yields 

(n-l 
y^jwjp + 1) + cr P +i(«)] k + l-p) + a n+ i- p (a) 
P =i 

(4.16) 

< [J>y52 S P+i( a i \ a o\) x S'n+i-pCa, l°o|)J = S n+ i(a, |a |). 

To prove the result by induction we assume ()4.15|) is true up to order n + r. Then at 
order n + r + 1 we have 

w(n + r + 1) = io(n + r + 1) + tr n+r+1 (a) (4.17) 

(n+r—l \ 
J] [w(p + 1) + a p+ i(a)] [iu(n + r + 1 - p) + <x„+ r+ i_ p (ct)] J (4.18) 

(n+r-l \ 
^ Sp+iCa, |«o|) x S' n+r+ i_p(a, |a |) J = SV l+r+ i(a, |a |) (4.19) 

which completes the proof. 



□ 



We now provide some definitions and results pertaining to convolution-type sequences 
such as the S' q s of (jHHJ). 
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Proposition 4.2. For any scalar fi, a fi— convolution of order m is an infinite sequence 
{u k } consisting of m initial scalars u\, u?, ■■■■,u m with subsequent terms defined as 



u q 



'9-1 

jU I yu p u q ^ p I , q = m + l,m + 2, ... (4.20) 



For a fi— convolution {v q } of order m = 1, we have 

v q+l = ViC q (v lf i) q ,q = 0,l,... (4.21) 
where the C' r s are the Catalan numbers 

C r = . , ,r = 0,l,2,...; C r ~ -S-^ for r ^ oo. (4.22) 

For a \x— convolution of order m with fi > consisting of m initial nonnegative terms 
u\,U2,...,u m we define 

Vi = max ( — ^— r I • (4.23) 

fe=l,2,...,m \ C/ c _i/i'"~ 1 / 

T/ie sequence u k is then bounded by the /i- convolution {v k } of order 1 and initial term Vi, 
i. e. for any q : 

u q+ i < v q+1 = v\ +l ^C q ~ ^f+V 9 -^ /or g -> oo. (4.24) 



Proof. One of the denning relations for the Catalan numbers is 

C s = C C s -i + dC s -2 + - + CUCo, s = 1, 2, ... (4.25) 

Equation 1)4.21)1 follows then from a straightforward proof by induction and the asymp- 
totic form of 1)4.22)1 results from Stirling's formula The result of 1)4.24)1 is a direct 
consequence of ()4.23|) which states that u k < v k for k = 1,2, ...,m. 

□ 

We will now use these results to assess the radius of convergence of the power series 
Theorem 4.3. A lower bound for the radius of convergence of £ f3 m t m is 



^f. 1 / / S k+1 (a,\a \) \\M\\ 

- max : 

4U=i,2 I ...,n-i\ C k ^(\a \y 



LBRC(a, \a Q \) = - ( _ max i [ — „ fc/n ) (4.26) 
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Proof. We first define 

t i \\ de f- f S k +i{a,\a \)\ 1/k , s 

v 1 (a,\a \)= max — , 4.27 

fc=l,2,...,n-l y Cfc„i/i ft 1 / 

and bear in mind that the /x— convolution S q (a, | a-o | ) of order n — 1 starts at g = 2. There- 
fore 62(0!, l a o|) is M i °f Proposition 14.21 and more generally S m (a, |a |) is w m _i. The re- 
sults of ()4.15jl and ()4.24jl then yield for any m 

w(m) < S m (a, \a \) < Vl (a, laol)™" 1 x p m ~ 2 x C m _ 2 (4.28) 

Am— 2 

~ |a |) m -V m - 2 ^ ^ M m - 00. (4.29) 

v/7r(m - 2) 7 

The series \/3 m t m \ converges if the sequence < |An| l7m |^l \ can be bounded by a 

L J m=l,2,... 

sequence converging to a limit <1. Given Eq. ()3.27|) and the fact that p = \a \, we have 

\Pm\ 1/m \t\ = p l/n - l/m \W{m) x \ 1/m \t\ (4.30) 



4|t| x ||M|| x^(a,|a |) f s fc+ i(<*, |oq|)||M|| \ V * 

~ — n = 4m max — ^ — tor m — > oc 

(laol) 1 " 17 ™ W,2,...,n-i^ C7 Jk _ 1 (|a |) 1 -* /B J 

(4.32) 

which yields the desired result of f)4.26j) . 



□ 

We now know that ^2/3 m t m has a positive radius of convergence which we call RC(a) to 
emphasize its dependence on the vector a = (a^) of coefficients. We do not have an 
analytical expression for RC(a) but we do have the lower bound LBRC (a, \a \). 
For any \t\ < RC(a) we may then define 

e g.«/M7( B ) + i < l (4 33) 

in which case there exists A > such that 

\b m t m \ = \K(m,B l m )t m \= \p m \\t\ m < Ac m , m = 1,2,... (4.34) 



In order to show that when the series ^x(t,k) = ^ p m e 2k7rmx ^ n t m converge they 
provide the n roots of Equation (J2.2|) . we need to prove a result on the growth of the 
K(d,B p q+1 )'s. 
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Lemma 4.4. // \t\ < RC(a) then there exists A > and c (0 < c < 1) such that 

\K(d,B p q+1 )\<AP(q+ir\c/\t\) d , W>p>l, q>0. (4.35) 



Proof. We prove the result by induction on p. Equation 1)4.34)1 shows that if d < q + 1 
then 



K(d,Bl +1 )\ = \p d \<A(c/\t\Y 



(4.36) 



which proves ()4.35|) for p — 1; (J4.35)) is trivially true if d > q+ 1 since then K[d, -Bg+i) 
0. For p = 2, Eq. (I2~TU1) yields 



\K(d,B 2 q+1 )\ 



min(d—l,q+l) 

b ™ K { d 

- m, Bg +1 ) 



m=l 



< 



(4.37) 



9+1 



9+1 



^|& m ||#(d-m,£j +1 )l < J2Mc/\t\) m A(c/\t\) d - m = A 2 (q + l)(c/\t\) d . (4.38) 



m=l 



m=l 



We now assume that ()4.35|) is true up to order p < d and calculate | K {d, -Bg+i ) | • Equation 
(gHH) yields 



mm(d— g+1 
m=l m=l 



9+1 



< 5>(c/|t|)*M*( g + lr^c/ltl)^ = ^ +1 (g + l) p (c/\t\) d 

m=l 

which is the desired result. 



(4.39) 



(4.40) 



□ 

In the next section we bring together previous results and prove that when it converges, 
the power series x(t, k) provides the roots of Equation (12.2(1 . 



5 Main result 

Theorem 5.1. We consider the following polynomial equation, parameterized by t > 
and with qq = pe ld ^ : 



x 



n = (a n ,_ix n 1 + a„_ 2 x™ 2 + ... + aix + a )t n . 



(5.1) 



We define ot= max |afc| and recall the definition of the matrix M of ()3.19|) . We 

k' = - X,2,...,Ti 1 

i/ien define n — 1 numbers Sfc(a, |a |) recursively as follows: 



S 2 (a, |a |) d ='a + a/||M||, 



(5.2) 
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'9-2 



S q (a, looD^'-j^-r ( V][||M|| ( S , p + i (a, \a \) + a]S q ^ p (a, \a \) ) + a + a/||M||,g = 3,4, ...,n 

(5.3) 

We also define recursively the infinite sequence of (n — 1) dimensional vectors W(q) : 
W(l) = a (l 1 1)'; W(2) = —(l 2 n-l)' (5.4) 



W(q) = - 
a 



q—2 min(n—l,q—l) 

-M^2[u.W(p + l)]W(q-p) + U [ A p- W (<1-p)] 
p=i p=i 



■ ? = 3,4, 



(5.5) 
where: 

1. u is the (n-l) -dimensional row vector having 1 in first position and zeros elsewhere. 

2. U is the (n-l) -dimensional column vector ~(1 2 n — 1) . 

3. Each A p (p=l,2,...,n-l) is the (n — 1)- dimensional row vector with a p in 
p — th position and zeros elsewhere. 

If we define the sequence 



f] m = p™/« x e i(m9)/n x ty( m y a0; m = i, 2, 
then \/3mt m \ converges for \t\ < RC(a) and in particular for 



\t\ < LBRC(a, |a |) 



def. 1 



max 



S k+1 (a, |a |) \\M\ 



l/k s 



-i 



(5.6) 



(5.7) 



If \t\ < RC(a) the n convergent series 

oo 

x{t,k) = J2^me 2knmxi/n xt m , fc = 0,1, ...,n-l (5.{ 

m=l 

are the n roots of Eq. (|5.1j) in the sense that for each k = 0, 1, n — 1 i/ie expression 



E[x(t,k)W\x(t,k) c 



n-l 



1 n-2 



a n _i^(i,fc) 9 +a„_ 2 + ... + axx(t, k) q + a J^" 

(5.9) 

approaches /or g — ► oo. (T/ie x(t, fc) ? s are £/ie partial sums up to order q (Eq. ()2.3|) j. 
Proof. Given Eq. f|2.17|) the polynomial in t on the right-hand side of (|5.9|) only has terms 



of the form t n+q+s with s > 0. We will first show that 



x(t, k) t 



contributes a series of 



such terms that approaches when q — > oo. Indeed, ()4.35|) shows that the modulus of 
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the contribution of 



x(t,k) 



to the terms t n+q+s is 



q(n— 1)— n 



J2 t n+ « +s K(n + q + s,B%) 



s=0 



< 



s=0 



\n+q+s A n .„n—l 



A n q n -\c/\t\) 



n+q+s 



(5.10) 



A n q n-l c n + qJ2 c s 



(5.11) 



which approaches when q — > oo. 

r ~\p 

We similarly consider the modulus of the contribution of each x(t, k) q of (J5.9|) 

(for 1 < p < n — 1) to the series of terms t n+q+s . Given the term t n appearing on the 
right-hand side of ()5.9j) . this modulus is 



J2t q+s K(q + s,Bv) 



s=0 



< 



Y)t\ q+s A^-\c/\t\) 



q+s 



s=0 



A p q p - l c q ^c s 



(5.12) 



(5.13) 



which also approaches when q — > oo. (Because = 6" the term aot n on the right-hand 

-i n 

side of f!5.9|) cancels out with the term b™t n appearing in the expansion of x(t, k) q .) 

r i p 

This completes the proof since each one of the x(t, k) q terms on the right-hand side 
of E^x(t, k)^ approaches when q — > oo. 



□ 



Remarks: 

1. The functions x(t, k) of Eq. ()5.8|) can be viewed as Taylor expansions in the variable 
t of the roots of Equation ()5.1|) : when t is small the first few terms of the series provide 
approximate values for the roots (a numerical example is given below) . 

2. In view of ()5.6|) the radius of convergence RC(a) of x(t,u) as a power series in 
t can be calculated numerically as 



RC(a) = Urn inf Q(m) 



(5.14) 



where 



def. 



(5.15) 



3. Each term S q (a>, \ao\) is a polynomial in a with no constant term and positive 
coefficients. This insures that LBRC(a, \ao\) tends to infinity for a — > . Therefore 
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(|5.7j) is satisfied for a small enough which shows that the functions x(t, u) provide the 
roots when the coefficients other than a are small enough. 

4. Each S q (a, |a |) is a decreasing function of \a \ that approaches a + a/||M|| when 
| cio | —> oo. With (|ao|) 1_fc ^ n in the denominator of (|5.7jl . the function LBRC(a, \ao\) 
tends to infinity for |a g | — » oo. Therefore the functions x(t,u) provide the roots when 



\a \ is large enough. 

As a numerical illustration we consider the equation 

x 6 = [-x 5 + x 4 - 2x 3 - 3x 2 + 2x + 8)t 6 (5.16) 

with t set equal to 1. The lower bound LBRC(a, | oe.q | ) for the radius of convergence of 
the power series x(t, u) is 0.094. Therefore we do not know whether x(t, u) converges with 
t — 1. However LBRC(a, \ao\) is an extremely conservative bound calculated only with 
a, | do | and ||M||; LBRC(a, |ao|) thus represents a worst-case scenario on the coefficients 
dfc. For these reasons the condition \t\ < LBRC(a, |ao|) of (|5.7j) is probably of limited 
practical use. Its usefulness lies in the qualitative Remarks 3 and 4 above, which state 
that the function x(t, u) does provide the roots for a small enough or |a | large enough. 



a) Real and imaginary parts of function x(t,k) and of numerically calculated roots 




— Re(x(t,k)) 
- Im(xit.k)) 
* Re(root) 
o Im(root) 



9 10 11 12 



b) Sequence Q(m) and parameter t=1 




Figure 5.1: Function x(t,k) (with t=l) and sequence Q(m) 
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From Figure 5.1b the radius of convergence RC(a) assessed numerically as the limit 
of Q(m) appears to be about 1.05. Therefore the power series x(t,u) does converge 
with t — 1. (A few missing values in the Q(m) sequence arise when W(m) 1 = 0, i.e. 
Q(m) = oo.) 

The real and imaginary parts of x(t,k) are plotted in Figure 5.1a over two periods 
of length n = 6 (200 terms are used to calculate the series). These two parts taken 
at t — 1 and k = 0,1,2,3,4,5 coincide with those obtained using Matlab's built-in 
polynomial equation routine to solve Eq. ()5.16j) with t = 1 (stars and circles in Figure 
5.1a). 

With real coefficients for the polynomial equation the complex roots come in conjugate 
pairs. There are two such pairs. One for k = 1, k = 5 and the other for k = 2, k = 4. In 
addition there are two real roots at k = and at k = 3. In this and some other cases these 
real roots appear to be at local minima or maxima of the Re(x(t, k)) function. These 
special behaviors of and relationships between the real and imaginary parts no doubt 
arise from particular patterns in the sequence {W / (q , ) 1 } which have yet to be explored. 
These structures disappear with complex coefficients since in this case there are no more 
complex conjugate roots. 

The first five terms of the series x(t, u) are 

x(t,u) « V2e 2unxi/6 t + 0.08333e 4u " xi/6 t 2 - 0.18414e 6u " xi/6 t 3 

-0.14506e 8 "^ xi/6 t 4 + 0.11441e 10uwxi/6 t 5 . (5.17) 

There are not enough terms to calculate the roots when t — 1. With t = 0.4 however, the 
values provided by ()5.17j) are very close to those calculated numerically (Table l5~Tf . 



Table 5.1: x(t,k) and numerically calculated roots of Equation (5.16) with t=0.4 





k=0 


k=l 


k=2 


k=3 


k=4 


k=5 


x(0.4,k) 


0.565 


0.290+0.504i 


-0.300+0.474i 


-0.545 


-0.300-0.474i 


0.290-0.504i 


num. root 


0.564 


0.290+0.504i 


-0.301+0.474i 


-0.546 


-0.301-0.474i 


0.290-0.504i 



If the constant term ao = 8 m Eq- ()5.10j) is less than approximately 4, the radius of 
convergence for the series x(t,u) drops below 1. The method can no longer be used to 
solve Eq. (|5.16|) with t — 1. Several approaches have been tried in order to extend the 
method to the case ao small in which at least one root becomes close to 0. One possibility 
would be to transform the unknown in such a way that small roots are moved away from 
0. For example one could write the polynomial equation in terms of a changed unknown 
y — 1/x: if a root x is small then the corresponding y is large. However this approach 
did not change the problem. Another more promising possibility would be to inject the 
parameter t differently into the equation. For example one could use a similar approach 
after multiplying the left side of Eq. (|2.1j) by t n instead of the right side. Or one could 
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multiply each term of the equation by a power t p with a different and well-chosen p for 
each term. To date such attempts have proved largely inconclusive. 

Ours is only a first step, which shows that the roots of a particular class of polynomial 
equations can be expressed explicitly with an infinite number of rational operations and 
root extractions. It is to be hoped that some variant of the family of periodic functions 
x(t, u) will eventually emerge to provide closed-form expressions for the roots of arbitrary 
polynomial equations. 
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